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GTSYS.003A PATENT 
METHOD FOR GENERATING TRANSCRIPTIONALLY ACTIVE DNA 

FRAGMENTS 



5 Related Applications 

This application claims priority under 35 U.S.C. §1 19(e) to U.S. Provisional 
Application Serial No. 60/125,953, filed March 24, 1999. 



Background of the Invention 

10 Field of the Invention 

The present invention relates to a method for generating transcriptionally active 
DNA fragments. More specifically, the method relates to synthesis of a DNA fragment 
by polymerase chain reaction (PCR) using nested primers, promoter sequences and 
terminator sequences. 

15 

Description of the Related Art 

In addition to the tremendous progress made in the past few years in sequencing 
the human genome, efforts have also been made to sequence other organisms that are of 
biomedical importance. For example, complete genomic sequences have been obtained 
20 for Borrelia burgdorferi (cause of Lyme disease), Chlamydia, Heliobacter pylori and 

Mycobacterium tuberculosis. The fast-growing sequence information provides 
immense opportunities to reveal the basic biology of related organisms at the 
gene/molecular level and to develop novel therapeutics or vaccines against various 
pathogens. 

25 However, this vast sequence information also mandates a much more efficient 

and streamlined way to screen and identify genes of interest from tens of thousands of 
candidate genes. The conventional approach to gene screening and identification 
involves generation of a cDNA library, subcloning the DNA inserts into plasmid vectors 
(expression vectors), purifying plasmid DNA from bacteria for each individual cDNA 

30 clone and transfecting animal cells or tissues for functional analysis of the encoded gene 
product. This method, even in conjunction with the use of polymerase chain reaction 
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(PCR) to generate cDNA fragments to allow directional and in-frame cloning, is still 
time consuming, costly and difficult to automate. 

The present invention provides a simple, rapid method for the generation of 
transcriptionally active DNA fragments. 



One embodiment of the present invention is a method for generating a 
transcriptionally active DNA molecule, comprising polymerase chain reaction (PCR) 
amplification of said DNA molecule in the presence of a first DNA fragment (Fl), 
second DNA fragment (F2), first primer (PI), a second primer (P2), a third primer (P3) 
and a fourth primer (P4) wherein: Fl comprises a promoter sequence; F2 comprises a 
terminator sequence; PI is complementary to the 5' end of Fl; P2 is complementary to 
the 3' end of F2; P3 comprises a first region complementary to the 3' end of Fl and a 
second region complementary to the 5' end of said DNA molecule; P4 comprises a first 
region complementary to the 5' end of F2 and a second region complementary to the 3' 
end of said DNA molecule, whereby a transcriptionally active DNA molecule is 
produced by said PCR amplification. Preferably, Fl is the cytomegalovirus IE 
promoter. In one aspect of this preferred embodiment, the transcriptionally active DNA 
molecule encodes a therapeutic gene. The method may further comprise the step of 
adding a PNA tail to the 5'-end of PI and P2 prior to the PCR amplification. 
Preferably, a thymidine base immediately precedes the region of complementarity 
between the third primer P3 and the first DNA fragment Fl. In another aspect of this 
preferred embodiment, a thymidine base immediately precedes the region of 
complementarity between the fourth primer P3 and the second DNA fragment F2. The 
method may also further comprise the step of adding a PNA clamp to said 
transcriptionally active DNA molecule after said PCR amplification. Preferably, the 
method further comprises the step of adding a PNA molecule via a linker (PNA clamp 
tail) to primers PI and P2 prior to the PCR amplification. 



Figure 1 is a schematic diagram of the gene amplification method of the present 
invention. Oligonucleotide primers PI and P2 are complementary to the 5' end of a 
promoter sequence (Fl) and the 3' end of a terminator sequence (F2), respectively. P3 
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and P4 are oligonucleotide primers which contain regions complementary to opposite 
ends of Gene X. In addition, P3 and P4 contain regions which are complementary to the 
3' region of Fl and the 5' region of F2. An excess amount of primers PI and P2 are 
combined with Gene X, or with a mixture of DNA containing Gene X, and with Fl, F2, 
5 P3 and P4, or with a mixture of DNA containing Fl, F2 and Gene X which has been 
PCR amplified using P3 and P4, and subjected to PCR to produce a transcriptionally 
active linear DNA fragment containing Fl, F2 and gene X. 

Figure 2 is a schematic diagram showing DNA fragment Fl which contains a 
thymidine base immediately preceding the region of complementarity with the PCR 
1 0 intermediate (primer P3). 

Figure 3 is a schematic diagram showing the use of peptide nucleic acid (PNA) 
sequences and PNA clamps to protect the ends of linear expression DNA fragments. 
3 Primers PI and P2 contain a PNA tail which is resistant to proteolytic and exonuclease 

~ degradation. The resulting linear expression DNA fragment contains a PNA tail at each 

^ 15 of the 5 '-ends. The 3' ends are protected by addition of a PNA "clamp" which is also 

tj resistant to proteolytic and exonuclease degradation. 

y Figure 4 is a schematic diagram showing the use of a PNA "clamp" tail to 

L protect the 5' ends of the linear expression DNA fragment, followed by formation of a 

ij clamp during PCR to protect the 3' ends. 

J 20 

« Detailed Description of the Preferred Embodiments 

The present invention provides a simple, efficient method for generating 
transcriptionally active DNA fragments which can be readily transfected into animal 
cells or tissues by conventional nucleic acid transfection techniques, without the need 
25 for subcloning into expression vectors and purification of plasmid DNA from bacteria. 

The transcriptionally active DNA fragments are synthesized by polymerase chain 
reaction (PCR) amplification of any gene of interest using nested oligonucleotide 
primers and two DNA fragments, one of which comprises an active transcription 
promoter sequence and one of which comprises a basic transcription terminator element. 
30 A first primer is complementary to the DNA fragment comprising the promoter; 

a second primer is complementary to the DNA fragment comprising the terminator; a 
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third primer is complementary to both the promoter sequence and one end of the gene of 
interest; and a fourth primer is complementary to both the terminator sequence and the 
other end of the gene of interest. These promoters, genes and terminators are linked in 
an expression cassette as shown in Figure 1. In addition, the ends of linear DNA 
5 containing the expression cassette can be protected from exonuclease digestion during 
and after transfection by incorporating peptide nucleic acid (PNA) sequences and PNA 
clamps as shown in Figures 3 and 4. 

As used herein, the term "promoter" is a DNA sequence which extends upstream 
from the transcription initiation site and is involved in binding of RNA polymerase. 

10 The promoter may contain several short (<10 base pair) sequence elements that bind 
transcription factors, generally dispersed over >200 base pairs. A promoter that 
contains only elements recognized by general and upstream factors is usually 
transcribed in any cell type. Such promoters may be responsible for expression of 
cellular genes that are constitutively expressed (sometimes called housekeeping genes). 

15 There are also tissue-specific promoters limited to particular cell types, such as the 
human metallothionein (MT) promoter which is upregulated by heavy metal ions and 
glucocorticoids. 

As used herein, the term "terminator" is a DNA sequence represented at the end 
of the transcript that causes RNA polymerase to terminate transcription. This occurs at 
20 a discrete site downstream of the mature 3' end which is generated by cleavage and 
polyadenylation. 

The present method can be used to quickly generate nuclease-resistant and 
transcriptionally active linear DNA molecules which express any desired gene with any 
known sequence. The linear DNA can then be delivered into animal cells or tissues for 
25 functional analysis, vaccination and other pharmaceutical applications. This method 

also avoids problems associated with bacterial growth such as toxicity and stability. 
This method can also be completely automated for use in high-throughput screening 
methods. 

Referring now to Figure 1, oligonucleotide primers PI and P2 are 
30 complementary to the 5' region of DNA fragments Fl and F2, respectively. Fragment 
Fl comprises a transcription promoter (darkened region) and fragment F2 comprises a 
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transcription terminator (darkened region). Primers P3 and P4 are complementary to 
the 3' ends of Fl and F2, respectively, and to different ends of the DNA fragment 
containing Gene X. Figure 1 shows the putative intermediates and the final product 
which are formed in the present PCR-based method when PI, P2, P3, P4 , Fl and F2 are 
5 added to a DNA template comprising Gene X. It should be noted that amount of 

primers PI and P2 present in the reaction mixture are preferably at least about 100 fold 
greater than the amounts of P3, P4, Fl and F2, to minimized the amounts of 
intermediate products generated during the reaction. To generate the first intermediate, 
primer P3 and P4 are used which amplify from the promoter and terminator sequences 

10 of Fl and F2 across Gene X, resulting in an intermediate having the promoter and 
terminator sequences on the ends and Gene X in between these sequences. 

The second intermediate is formed by hybridization of the first intermediate with 
fragments Fl and F2 via their complementary promoter and terminator sequences 
(darkened regions), followed by PCR amplification from primers P3 and P4, resulting in 

1 5 a DNA segment comprising the entire F 1 , F2 and Gene X. 

In the last step of the reaction, PCR amplification of the second intermediate 
using primers PI and P2 results in amplified amounts of the complete transcriptionally 
active DNA fragment. 

The method of the invention may be performed by adding all of the components 

20 in a single reaction mixture. Alternatively, two separate PCR reactions can be 
performed. In the first reaction, the template gene, P3 and P4 are used first. The 
product of this reaction is then used as a template for a second PCR reaction involving 
fragments Fl and F2, plus primers PI and P2. 

The generation of final products using the nested PCR method describe above is 

25 dependent on the sequences of the Fl or F2 fragments at the junction between the 

region overlapping P3 and P4, respectively. This is due, at least in part, to the addition 
of an extra adenosine base (A) at the 3' end of the PCR fragment by Taq DNA 
polymerase which is commonly used in PCR protocols. This could produce a mismatch 
between the PCR intermediate generated by P3/P4 and fragments Fl and F2. 

30 Thus, in a preferred embodiment, the overlap between the PCR intermediate (P3 

primer) and Fl, and P4/F2, is designed such that a thymidine base (T) immediately 
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precedes the overlap region in fragments Fl and F2 (Fig. 2, only Fl shown). In this 
case, even with the addition of an "A" base to the 3' end to the intermediate during PCR 
amplification, the overlap between F1/F2 and the respective PCR intermediates is still 
perfectly maintained. 

5 Peptide nucleic acids (PNA) are nucleic acid analogs in which the entire 

deoxyribose-phosphate backbone has been exchanged with a chemically completely 
different, but structurally homologous, polyamide (peptide) backbone containing 2- 
aminoethyl glycine units. Unlike DNA, which is highly negatively charged, the PNA 
backbone is neutral. Therefore, there is much less repulsive energy between 

10 complementary strands in a PNA-DNA hybrid than the comparable DNA-DNA hybrid, 
and consequently they are much more stable. 

In addition, molecules called PNA "clamps" have been synthesized which have 
two identical PNA sequences joined by a flexible hairpin linker containing three 8- 
amino-3,6-dioxaoctanoic acid units. When a PNA clamp is mixed with a 

15 complementary homopurine or homopyrimidine DNA target sequence, a PNA-DNA- 
PNA triplex hybrid can form which is extremely stable (Bentin et al., Biochemistry 
35:8863-8869, 1996; Egholm et al, Nucleic Acids Res. 23:217-222, 1995; Griffith et al., 
J. Am. Chem. Soc. 117:831-832, 1995). The sequence-specific and high affinity duplex 
and triplex binding of PNA have been extensively described (Nielsen et al, Science 

20 254:1497-1500, 1991; Egholm et al, J. Am. Chem. Soc. 114:9677-9678, 1992; Egholm 
et al., Nature 365:566-568, 1993; Almarsson et al., Proc. Natl. Acad. Sci. U. S A. 
90:9542-9546, 1993; Demidov et al., Proc. Natl. Acad. Sci. U. S A. 92:2637-2641, 
1995). They have also been shown to be resistant to nuclease and protease digestion 
(Demidov et al., Biochem. Pharm. 48:1010-1013, 1994). 

25 A representative PNA clamp is shown in Figs. 3 and 4. This PNA clamp, or a 

PNA clamp having any desired DNA sequence, is synthesized as described by Egholm 
et al. (supra.) or is available from PerSeptive Biosystems (Framingham, MA). The 
representative PNA clamp shown in Figs. 2 and 3 has the sequence 5'-TCTCTCTC-0- 
O-O-TJTJTJTJ (SEQ ID NO: 1), where J (pseudoisocytosine) is an analog of C, and the 

30 "O" residues are 8-amino-3,6-dioxaoctanoic acid linkers which separate the two regions 
of the PNA. 
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The use of PNA tails, PNA clamps and PNA "clamp tails" to protect the ends of 
tiig transcriptionally active PCR fragments which are described above is illustrated in 
{/^ Exgnre-At The ends of the PCR fragments generated as shown in Figure 1 are prone to 

digestion by exonucleases after transfection into a cell or tp^sue. Two strategies for 
5 inhibiting this degradation are shown in Figures 3 and 4. In ¥i^^o^ a PNA tail and a 

short DNA sequence (5'-GATC-0-TCTCTCTC-3'; SEQ ID NO: 2) is added to the 5' 
end of primers PI and P2 which bind to the target sequence 5'-GAGAGAGA-3' (SEQ 
ID NO: 3). This generates a PNA binding site at the 3' end of the opposite strand 
during PCR. Primers containing PNA tails can be synthesized using methods well 

10 known in the art or are available from PerSeptive Biosystems. These PNA tails do not 

hybridize to a target sequence, but protect the 5' -ends from exonuclease digestion. In 
order to protect the 5' ends of the PCR fragment, a PNA molecule is added to the 3'- 
PNA tail-protected PCR fragment. This PNA contains a DNA sequence which is 
complementary to the 3 '-end of the PCR fragment and binds thereto as shown in Fig. 3. 

15 An alternate PNA protection approach is shown in Fig. 4. In this method, a 

PNA clamp is linked to primers PI and P2 via one or more 8-amino-3,6-dioxaoctanoic 
acid linkers to form a PNA "clamp" tail by well known methods. After PCR 
amplification, the "clamp tail" binds to the 3' ends of the PCR fragment, and is linked 
to the 5' ends via the one or more 8-amino-3,6-dioxaoctanoic acid linkers to protect 

20 both the 3' and 5' ends of the PCR fragment. 

Although the use of the CMV IE promoter and an artificial mammalian 
transcriptional terminator elements are exemplified herein, the use of any eukaryotic 
promoter and terminator is within the scope of the present invention. Suitable 
promoters for use in the present invention include, for example, SV40, rous sarcoma 

25 virus (RSV), retroviral long terminal repeats (LTR), muscle creatine kinase promoter, 

actin promoter, elongation factor promoter, synthetic promoters, tissue-specific 
promoters, and the like. Suitable terminator sequences include SV40 transcription 
terminator, bovine growth hormone (BGH) terminator, synthetic terminators, and the 
like. These promoter and terminator sequences can be obtained by restriction enzyme 

30 digestion of commercially available plasmids and cDNA molecules, or can be 

synthesized using an automated DNA synthesizer using methods well known in the art. 
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Any desired gene may be amplified and coupled to active promoter and 
terminator sequences using the method of the present invention. In a preferred 
embodiment, the gene encodes a gene product which is absent or present at reduced 
levels in an organism. Nonlimiting examples of these gene products are the cystic 
fibrosis transmembrane regulator (CFTR), insulin, dystrophin, interleukin-2, 
interleukin-12, erythropoietin, gamma interferon, and granulocyte macrophage colony 
stimulating factor (GM-CSF). 

Although any transfection method well known in the art may be used to transfect 
the transcriptionally active PCR fragments of the invention into cells or tissues 
including calcium phosphate precipitation, electroporation and DEAE-dextran, cationic 
lipid-mediated transfection is preferred. Gene delivery systems are described by 
Feigner et al. (Hum. Gene Ther. 8:511-512, 1997) and include cationic lipid-based 
delivery systems (lipoplex), polycation-based delivery systems (polyplex) and a 
combination thereof (lipopolyplex). Cationic lipids are described in U. S. Patent Nos. 
4,897,355 and 5,459,127, the entire contents of which are hereby incorporated by 
reference. 

Example 1 

Production of transcriptionally active PCR fragments 
The following components are combined in a 100 jil reaction volume: DNA 
fragment Fl (4 ng) which comprises a region of high transcriptional potency (-240 to 
+60) from the human cytomegalovirus (CMV) immediate early gene (IE) 
promoter/enhancer, DNA fragment F2 (4 ng), a 55 base pair oligonucleotide encoding 
an artificial mammalian transcription terminator element (5'- 
CACAAAAAACCAACACACAGATCTCTAGAGCTCTGATCTTTTATTAGCCAG 
AAGT-3'; SEQ ID NO: 4), 400 ng primer PI (5'-TCTCTCTACGTATTAGTCATCG- 
3*; SEQ ID NO: 5), 400 ng primer P2 (5 ' -TC AC AAAAAACC AAC AC AC AG-3 ' ; SEQ 
ID NO: 6), 4 ng primer P3 and 4 ng primer P4. The PI and P2 primer sequences 
correspond to the 5' end of fragment Fl and the 5' end of fragment F2, respectively 
(Fig. 1). Primers P3 and P4 are designed based on the gene sequence of interest. The 3' 
potion of these primers (about 10-20 base pairs) is determined by the actual sequence of 
the gene to be amplified, whereas the 5' region of P3 comprises the following sequence: 
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5'-CTCCGCGGATCCAGA-3' (SEQ ID NO: 7) to overlap with the 3' region of Fl, and 
P4 comprises the sequence 5 '-TTATTAGCCAGAAGT-3 ' (SEQ ID NO: 8) to overlap 
with the 3' region of F2. 

PCR is performed as follows: denaturation for 30 seconds at 94°C, annealing 
for 45 seconds at 55°C and extension for three minutes at 72°C for 25-30 cycles. The 
size of the final produce is verified by 1% agarose gel electrophoresis. The amplified 
PCR fragment is cleaned and purified using a commercial PCR cleaning kit (e.g., 
Qiagen), and can be used for in vitro or in vivo transfection of cells or tissues. 



The green fluorescent protein (GFP, 700 bp) was used as a target gene. The Fl 
fragment was a 600 bp fragment of the CMV immediate early gene promoter (from -550 
to +50). A 50 bp oligonucleotide containing a modified rabbit beta-globin gene 
transcription terminator was used as primer 2. Two different versions of these primers 
were designed so that the overlap between Fl and the intermediate GFP PCR fragment 
was either preceded with or without a thymidine base. After the PCR reaction was 
carried out according to the conditions described in Example 1, the products were 
analyzed by electrophoresis. When a thymidine base preceded the overlap region in Fl, 
a clean major PCR fragment of about 1.4 kb was produced, representing the GFP coding 
region flanked by CMV promoter (Fl) and the transcription terminator, whereas if a 
base other than thymidine preceded the overlapping region, no product was generated. 
The fragment generated by nested PCR was then transfected into Cos-7 cells and the 
expression of GFP was monitored by fluorescence microscopy. The results showed that 
the intensity and frequency of GFP-expressing cells was almost the same between cells 
transfected with the PCR fragment and a supercoiled plasmid DNA in which the same 
CMV promoter was driving expression of the GFP gene. Thus, the present method 
produces DNA fragments which are transcriptionally active. 

While particular embodiments of the invention have been described in detail, it 
will be apparent to those skilled in the art that these embodiments are exemplary rather 
than limiting, and the true scope of the invention is that defined in the following claims. 



Example 2 
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